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Holder regularity for a non-linear parabolic equation driven by 

space-time white noise 
F. Otto, H. Weber 

Abstract. We consider the non-linear equation T~ 1 u + dtU — 8 2 tt{u) = £ 
driven by space-time white noise £, which is uniformly parabolic because 
we assume that n' is bounded away from zero and infinity. Under the fur¬ 
ther assumption of Lipschitz continuity of 7 r' we show that the stationary 
solution is — as for the linear case — almost surely Holder continuous 
with exponent a for any a < | w. r. t. the parabolic metric. More pre¬ 
cisely, we show that the corresponding local Holder norm has stretched 
exponential moments. 

On the stochastic side, we use a combination of martingale arguments to 
get second moment estimates with concentration of measure arguments 
to upgrade to Gaussian moments. On the deterministic side, we first 
perform a Campanato iteration based on the De Giorgi-Nash Theorem as 
well as finite and infinitesimal versions of the iJ _1 -contraction principle, 
which yields Gaussian moments for a weaker Holder norm. In a second 
step this estimate is improved to the optimal Holder exponent at the 
expense of weakening the integrability to stretched exponential. 

1 Introduction and main result 

We are interested in the stochastic nonlinear parabolic equation 

T _1 n + d t u - d 2 x i t(u) = £, (1) 

where £ denotes space-time white noise. The nonlinear character of (JT]) is that 
of a fully nonlinear equation rather than a quasi-linear equation, since rewrit¬ 
ing © as the quasi-linear equation (JT)) is not helpful as we explain below, 
and since the deterministic estimates we need are related to the linearization 
of a fully nonlinear equation, cf. (TT5T) . rather than to the linearization of a 
quasi-linear equation (this distinction would be more pronounced in a multi¬ 
dimensional case). We assume that the nonlinearity 7 r is uniformly elliptic 
in the sense that there exists a A > 0 such that 

A < 7 r'(u) < 1 for all u E M. (2) 

In particular, this rules out the degenerate case that goes under the name of 
porous medium equation. Furthermore, we assume some regularity of it in 
the sense that there exists L < 00 such that 

\tt"(u)\ < L for all u E M. (3) 
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We are interested in Holder regularity of solutions of (JT]); the simplest solution 
to (JT|) is the space-time stationary solution u of ([T]) on which we shall focus 
in this paper. The main reason for including the massive term in (JT]) (i. e. 
assuming T < oo) is to ensure existence and uniqueness of this object; the 
only other role is to provide a large-scale estimate through Lemma [TJ. In 
this version of the paper, we will be completely informal about why and in 
which sense (JT]) is well-posed, and why the martingale and concentration of 
measure arguments can be carried out (we will just motivate them when we 
first need them). 

A crucial insight is that the law of the (unique) stationary random held u is 
invariant under the rescaling 

x = Rx, t = R 2 t, u = R^u, (4) 

provided, the nonlinearity and the massive term are adjusted according to 

7r(u) = R~2 tt(Rzu), T = R~ 2 T. (5) 

For this observation we used that in view of its defining relation 


< 



/ 


( 2 dxdt 


for atest function ( (that is, loosely speaking J ( (£(t, x)£(0, 0 ))dxdt = £(0, 0)), 
space-time white noise rescales as f = A — = R~%f. From this invariance 

v RR Z 

property we learn that as we go to small scales (i. e. R - Cl), the effective 
nonlinearity as measured by the Lipschitz constant L of n' in ([3]) decreases 
according to 

L = RhL. (6) 

This suggests that on small scales, u has the same regularity as if ([T]) were 
replaced by its linear version (without massive term) dtu — aod 2 u = f for 
some constant ao G [A, 1]. Hence we expect that on small scales, u is Holder 
continuous with exponents a (in the parabolic Carnot-Caratheodory geome¬ 
try) for any a < This is exactly what we show, making crucial use of the 
above scale invariance. 

We note in passing that it is not helpful to write the elliptic operator in the 
more symmetric form 

T~ l u + d t u - d x (7r'(u)d x u) = £, (7) 

since even in case of the stochastic heat equation, u (and thus n '(«)) is 
a function in the Holder space with exponent A— so that d x u would be 
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a distribution in the (negative) Holder space with exponent — , so that 

there is no standard distributional definition of the product n'{u)d x u. In 
fact, rather than appealing to regularity theory for linear but non-constant 
coefficient equations of the form T" x u + d t u — d x (ad x u ) = d x g, we have to 
appeal to the theory for T^w + d t w — d x (aw) = d x g , cf. Proposition [3j 

Let us now briefly comment on existing regularity theory for non-linear 
parabolic stochastic differential equation of the type of 0. There is a large 
body of literature on stochastic equations of the type dH), but mostly with 
a quite different focus: The focus there is to tackle on the one hand more 
nonlinear situations, like the case of a degenerate ellipticity (i. e. A = 0 in 
0 ) or the case of multiplicative noise, but on the other hand to assume 
“whatever it takes” on the spatial covariance structure of the noise. Some¬ 
times, structural assumptions allow to mimic an approach that is obvious in 
the semi-linear case, namely the approach of decomposing the solution into 
a rough part w that solves a more explicitly treatable stochastic differential 
equation and a more regular part v that solves a parabolic equation with 
random coefficients and/or right-hand-side described through w, and then 
allows for an application of deterministic regularity theory. We refer to [7J 
for an example with a multiplicative decomposition of this type. The recent 
work by Debussche et. al. on quasi-linear parabolic stochastic equations, i. e. 
equations of the form (0 or more generally with an elliptic operator of the 
form —V • a(tt)Vu, refines this approach to a fixed point argument, and ap¬ 
peals to the De Giorgi-Nash Theorem, which yields a Holder a priori bound 
on linear parabolic equations with just uniformly elliptic coefficients as a 
starting point to bootstrap to the optimal Holder continuity via Schauder 
theory, see jU Introduction], However, cf. the above discussion of 0, this 
treatment seems limited to situations where the noise £ is so regular that in 
the case of the linear equation, Vu is at least locally integrable in time-space 
(to be more quantitative: f | Vu\ p dxdt < oo for some p > 3 on the level of 
space-time isotropic LP- norms). 

By the equivalence of Campanato and Hdlder spaces, see for instance [SJ 
Theorem 5.5], Holder continuity can be expressed in terms of a localized Lr- 
modulus of continuity. Because of the eventual conditioning on the distant 
noise, it is more convenient to replace a sharp spatial localization on parabolic 
cylinders by a soft localization via an exponentially decaying function 

1 \ X 

V(x) = - exp(—|x|), r] r (x ) := -t?(-), (8) 

note that the normalization imply that f rj r ■ dx corresponds to a spatial 
average that is localized near the origin on scale r. We note that while the 
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exponential form of the cut-off is probably not essential (any thicker than 
Gaussian tails should suffice), it is convenient at many places of the proof. 
Abbreviating the L 2 -modulus of continuity at the origin and on parabolic 
scale r by 

D 2 (u,r ) := -f / r) r (u — 4 - / rj r u) 2 dxdt , 


our first main result reads as follows: 


Theorem 1. Let u be the unique stationary solution to ([I]). W. 1. o. g. 
suppose that T = 1 in (QJ). There exists a Holder exponent a 0 E (0, |) 
depending only on X, so that we have a Gaussian bound for the ao-Holder 
L 2 -averaged modulus of continuity at the origin in the sense that 



with a constant C < oo only depending on X > 0, L < oo and «o- 

The Holder exponent a 0 E (0, |) is determined by an application of the 
celebrated De Giorgi-Nash Theorem via Proposition [0 below (in fact the 
a o in Theorem Q] is slightly smaller than the «o hi Proposition Q]). It only 
depends on the ellipticity ratio A. In our second main result we improve the 
Holder regularity exponent up to the optimal value a = A— at the expense 
of weakening the integrability. 

Theorem 2. Let u be the unique stationary solution to (JT]) for T = 1. Let 
cco £ (0, |) be the Holder exponent appearing in Theorem[ 7J Then for any 
Holder exponent a E (ao> |) we 9 e t 

2 \ 2 0:0 

( exp (cP<?G D( “' r) ) j)- 2 ’ (10) 

with a constant C < oo only depending on X > 0, L < oo and a < 

Theorems Q] and [2] imply bounds for the more conventional local Holder semi¬ 
norms of the random field u. For any a E (0,1) we set 

[u] a = sup sup \u(t,x) -u(s,y)\. (11) 

Re( 0,1) (t,x),( s,y)e (- l,0)x(—1,1) 

y/\t-s\+\x-y\<R 

Theorem [2] implies the following: 

Corollary 1. Under the assumptions of Theorem^ we have 



for a constant C < oo which only depends on X > 0, L < oo, a < \ and 
e > 0. 
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2 Strategy of proof and ingredients 

Theorem [TJ like Lemma Q] below, relies on a concentration of measure ar¬ 
gument for Lipschitz random variables: For any a random variable F that 
is 1-Lipschitz when considered as a path-wise functional of the white noise 
£, one has (exp(AF)) < exp(A(F) + ^A 2 ) for any number A. In particu¬ 
lar, if F > 0 is 1-Lipschitz and satisfies (F) < 1, it has Gaussian moments 
(exp(^F 2 )) < 2, for some universal constant C < oo. Here the norm un¬ 
derlying the Lipschitz property is the norm of the Cameron-Martin space, 
which simply means that infinitesimal variations h£ of the space-time white 
noise are measured in the space-time L 2 -norm. To continue with the name- 
dropping, this type of Lipschitz continuity means that the carre-du-champs 
|VF| 2 of the Malliavin derivative is bounded independently of the given re¬ 
alization of the noise, where for a given realization £ of the noise, | VF| is the 
smallest constant A in 

/ I 

m 2 dxdt)\ ( 12 ) 

Here SF denotes the infinitesimal variation of F generated by the infinites¬ 
imal variation <5£ of the noise £, a linear relation captured by the Frechet 
derivative (a linear form) of F w. r. t. £. For those not confident in this 
continuum version of concentration of measure we derive it from the discrete 
case in the proof of Lemma HI where also the type of martingale arguments 
entering Proposition [Tj via Lemma[2](and Lemmas [3] and H] again) is explained 
for the non-expert. 

Both in the proofs of Theorem |T] and Theorem [2j concentration of measure 
will be applied to the random variable F = D(u,r). It is Proposition [2] 
which provides the bound on the Malliavin derivative w. r. t. to the ensemble 
(•)i that describes the space time white noise £ restricted to the time slice 
(t,x) G (—1,0) x R. In particular, this means that the admissible variations 
h£ in (fT2|) are supported in (t,x) G (—1,0) x R; we denote by |VF| 2 the 
corresponding carre-du-champs. Proposition Q] in turn provides the estimate 
of the (conditional) expectation, that is, the expectation in (-)i which is used 
in Theorem [TJ while the proof of Theorem [2] relies on Proposition [5j 
Proposition [1] provides a bound on the expectation of Tj- D(u,r ) in terms 
of quantities that are linear in D'(u, 1) (which roughly behaves as D(u, 1)) 
and in combination with Proposition [2] this bound can be upgraded to Gaus¬ 
sian moments. Using the scale invariance ([1J) and ([5]) this estimate can then 
be used in a (stochastic) Campanato iteration leading to Theorem [TJ The 
drawback of Proposition |Tj is that it is restricted to the small Holder expo¬ 
nent ao because the proof relies on the De Giorgi-Nash Theorem. Proposi- 
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tion[3](in conjunction with Proposition [21) in turn yields Gaussian moments 
for ^D(u,r), for all admissible exponents a E (0, |), however only up to 
1 + r^D'(u, 1), which roughly behaves as 1 + r^D(u, 1), and modulo the 
multiplicative (and nonlinear) error of -4-(l + D(u, 1)). Evoking the scale in- 

r 2 

variance (HI) fc (l5l) . this estimate will be used for small scales, where thanks to 
the behavior (jH]) of L , the multiplicative error fades away, so that Theorem [2] 
can be obtained by another (non-linear) Campanato iteration. Theorem [T] is 
needed as an anchoring for this iteration. 

Since these propositions will be applied to small scales, so that in view of ([5]) 
the massive term fades away, we cannot expect help from it; as a matter of 
fact, we will ignore the massive term in the proof (besides in Lemma Q] where 
it is essential). 

Proposition 1. There exists a Holder exponent oo £ (0, |), depending only 
on A, such that we have all r <1 

(■ D(u , r)>! < (l + D\u, 1) + 4 (D\u, 1) + 1)), 

\ f 2 / 

where D'(u, 1) depends only on u(t = —1, •): 

D' 2 (u, 1) := J rj{u - J r)u) 2 dx\ t= -i. (13) 

Here and in the proof, < means up to a constant only depending on A > 0. 
Proposition 2. We have for the carre-du-champs of the Malliavin derivative 

\X7D(u,r)\i < r"i, 

where here and in the proof, < means up to a constant only depending on 

A > 0. 

The only purpose of the presence of the massive term is that in the original 
scale, it provides control of the L 2 -averaged Holder continuity on scales 1, 
and thus the anchoring for the Campanato iteration: 

Lemma 1. Suppose that T = 1 in (QJ). Then we have 

<exp(lo 2 (t.,l))><2 
for some constant C only depending on A. 
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In order to derive Propositions Q] and [21 we will consider differences of so¬ 
lutions to (HJ) for Proposition [U or infinitesimal perturbations of solutions 
for Proposition [2j Finite or infinitesimal differences of solutions satisfy a 
formally linear parabolic equation with an inhomogeneous coefficient field a, 
which in view of (J2J) is uniformly elliptic: 

A < a(t,x ) < 1 for all (t,x) G (—1,0) x M. (14) 

The linearized operator comes in the conservative form of dtu-dl(au). For a 
priori estimates of the corresponding initial value problem, it is most natural 
to write the r. h. s. also in conservative form: 

d t w - d 2 x (aw ) = d t h + dig. (15) 

The L 2 -estimates on solutions of (fT5|) from Proposition [H] might be seen as 
an infinitesimal version of the //^-contraction principle for the deterministic 
counterpart of (JT]) , which will be explicitly used in Lemma [2] see the proof 
of Lemma [T] which is a good starting point for the PDE arguments, too. 


Proposition 3. Consider a solution w of m with r. h. s. described by 
(g,h). In the case h = 0 we get the local estimate 


sup / y/rjw(l — &l) 1 yCjwdx + 
te(—i,o) J 


gw 2 dxdt 


< 

r^j 


y/rjwO- - d x) y/rjwdx |t = -! + 


gg 2 dxdt. 


i 


In the case of general h and homogeneous initial data i.e. if 

w = h = 0 for t = — 1 
we get both the local bound 


gw 2 dxdt < 


7](g 2 + h 2 )dxdt 


'-l, 


and the global bound 


w 2 dxdt < 


(. g 2 + h 2 )dxdt. 




(16) 


(17) 


(18) 


Here and in the proof <C and < refer just to X. 
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However, next to this “soft” a priori estimate for solutions of the initial value 
problem for 0151) . we also need the following “hard” a priori estimate. Well be¬ 
yond the /Abound in Proposition [3] Proposition [4] provides equi-integrability 
of v 2 for a solution of the homogeneous version of (fl5|) in the sense of a Mor- 
rey norm, for the latter see [H Definition 5.1]. It does so in a quite quan¬ 
tified way: (TTTU) provides equi-integrability as if v G L p (in time-space) for 
p = yyb_. > 3 , Loosely speaking, this equi-integrability arises as follows: The 
spatial anti-derivative V of the given solution v of the homogeneous version 
of (fT5j) satisfies the divergence-form, equation d t V — d x (ad x V ) = 0. Now 
the celebrated theory of De Giorgi and Nash, in particular in the parabolic 
version of Nash [IT], implies that for some a 0 > 0 depending only on the 
ellipticity A in f|14|) . the CK 0 -H61der norm of V is controlled by weaker norms 
of V. The connection between Nash’s result and (fH)|) is obvious on the level 
of scaling: The cto-Holder norm of V has the same (parabolic) scaling as the 
ZT-norm of v = d x V with «o = 1 — On the level of equi-integrability of 
( d x V ) 2 , this scaling analogy indeed can be made rigorous with help of the 
parabolic Caccioppoli estimate for V. 

Proposition 4. Let v be a solution to (fT5j) with right hand side h = f = 0. 
Then there exists an a 0 > 0, depending only on A such that for 0 < r < 1 we 
have the estimate 

p r v 2 dxdt < r ~ 2 + 2Q o J y^y(i — d 2 )- 1 y/rjv dx\ t _ r (19) 
Here and in the proof < only refers to A. 

The following proposition provides the non-linear version of Proposition |T] 
which is used in the proof of Theorem [2] In this proposition the restriction 
on the Holder exponents is removed and all exponents a < A are admitted. 

Proposition 5. Pick a Holder exponent a G (0, |). Then we have all r <1 
(■ D(u ,r))i < r“(l + 4 (D'(u, 1) + 1)) (l + r^Dfu, 1)), 

7"2 

where D'(u, 1) is defined in (lT3j) . Here and in the proof, < means up to a 
constant only depending on A > 0 and a < 

For this proposition we follow a standard approach in Schauder theory for 
parabolic (and elliptic) equations and consider (fT5|) with constant coefficients 
Oo G [A, 1], which will arise from locally “freezing” the variable coefficient field 



a: 


3,v - a a dlv = f. 


( 20 ) 



Proposition O states a classical L°° estimate for (12Uj) . the only difficulty com¬ 
ing from the low regularity of the initial data v\ t= -\ and the moderate reg¬ 
ularity of the r. h. s. / assumed in Proposition [6l We give a self-contained 
proof. 

Proposition 6. Consider a solution v of / fddl) with r. h. s. f. Then we have 
a localized L°°-estimate 

sup (t+l)^r]v 2 < f f r]f 2 dxdt + 

(t,a:)E(-l,0)xR J— 1 J 

We’d like to point out a synergy in terms of methods between this approach 
to regularity for stochastic partial differential equations driven by stationary 
noise, and an approach to regularity for elliptic partial differential equations 
with stationary random coefficient field that is emerging over the past years 
D2HHE!- At first glance, the differences dominate: Here, we have a nonlinear 
and parabolic partial differential equation driven by a random right-hand-side 
£, and we hope for almost-sure small-scale regularity despite the short-range 
decorrelation of £, which implies its roughness. There, the main features 
already appear on the level of a linear and elliptic equation, for instance 
on the level of the harmonic coordinates or the corrector given by —V • 
a(V fi + ef) = 0 where e* is the i-tli unit vector, and one hopes for almost-sure 
large scale regularity thanks to the long-range decorrelation of the coefficient 
field a. In the first case, randomness limits Holder regularity, whereas in 
the second case, randomness improves Holder regularity: In fact, for almost 
every realization of a, a-harnronic functions u satisfy a first-order Liouvillc 
principle [9], and even Liouvillc principles of any order [5|, which is the 
simplest way to encode large-scale Holder regularity. Even the lowest-order 
Liouvillc principle is known to fail for some uniformly elliptic and smooth 
coefficient fields a, so that these results indeed show a regularizing effect of 
randomness. 

Despite these obvious differences, the approach is very similar: Both here and 
there (in [T2] and, more explicitly, in [6], [9]) one is appealing to the com¬ 
bination of sensitivity estimates (how do certain functionals of the solution 
depend on the right hand side here, or on the coefficient field there?) mea¬ 
sured in terms of a carre du champs (of the Malliavin derivative here, or of a 
suitable vertical derivative that is compatible with the correlation structure 
there), and then appeals to concentration of measure (on the Gaussian level 
here, or via the intermediate of a Logarithmic Sobolev Inequality there). 

Such a synergy in methods that treat models with thermal noise like in high- 
or infinite dimensional stochastic differential equations with reversible invari¬ 
ant (Gibbs) measure and those that treat models with quenched noise like 
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in stochastic homogenization is not new: In their seminal work on Gradi¬ 
ent Gibbs measures, a model in statistical mechanics that describes ther¬ 
mally fluctuating surfaces, Naddaf and Spencer appeal to stochastic homog¬ 
enization to characterize the large-scale correlation structure of the field [32] . 
Their analysis can also be interpreted as considering the infinite-dimensional 
stochastic differential equation of which the measure is the reversible invari¬ 
ant measure, an equation which can be seen as a spatial discretization of a 
stochastic nonlinear parabolic partial differential equation, and to consider 
the Malliavin derivative of its solution with respect to the (discrete) space- 
time white noise [3]. Again, the nonlinearity is rather of the symmetric form 
© and Naddaf and Spencer appeal to Nash’s heat kernel bounds. 

We close this parenthesis by noting that for stochastic partial differential 
equations and stochastic homogenization, even the deterministic ingredients 
are similar: In both cases, the sensitivity estimate leads to a linear partial 
differential equation (parabolic here, elliptic there) with a priori only uni¬ 
formly elliptic coefficient field (in space-time here, in space there), that is, 
without any a priori modulus of continuity. In both cases, a buckling ar¬ 
gument is needed to obtain bounds on Hblder norms with high stochastic 
integrability. While here, the need of a buckling estimate is obvious since 
the small-scale regularity of the coefficient field a = n'(u) in the sensitivity 
equation is determined by the small-scale regularity of the solution u around 
which one is linearizing, the buckling is less obvious there: It turns out that 
the large-scale regularity properties of the operator —V ■ aV are determined 
by the large-scale properties of the harmonic coordinates ay + fa, the special 
solution mentioned above. Here, buckling proceed by showing that the linear 
operator d t — d\a is close to a constant coefficient operator d t — aod% on small 
scales, there, it proceeds by showing that it is close to a constant coefficient 
operator on large scales, namely the homogenized operator —V • a^V. In 
both cases, a Campanato-type iteration is the appropriate deterministic tool 
for the buckling. Here, this is not surprising since Campanato iteration is 
a robust way of deriving Holder estimates (see for instance [HI Chapter 5]); 
there, the use of Campanato iteration to push the constant-coefficient regu¬ 
larity theory from the infinite scale to large but finite scales was first intro¬ 
duced in [2] in case of periodic homogenization, then transferred to stochastic 
homogenization in |1], and refined in [9] in a way that brings it very close to 
its small-scale application. 

After this aside, we turn back to our proof. Next to these deterministic 
ingredients, Proposition [T] and Proposition [5] also require a couple of classical, 
second moment stochastic estimates. The first lemma provides such a low- 
stochastic moment estimate on the L 2 -Holder-| modulus of continuity, which 
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however is restricted to a spatial modulus and is only localized to scales 
1. This spatial L 2 -Holder modulus of continuity is expressed in terms of 
the L 2 -difference of spatial shifts (which are then exponentially averaged 
over the shifts); this form arises naturally from a martingale version of the 
(deterministic) //“^-contraction principle for equations of the form (JT() with 
uniform cllipticity ([2]). In fact, we use a spatially localized version of the 
//^-contraction principle. 

Lemma 2. Let u denote the stationary solution of (QJ) and denote by u h its 
spatial translation by the shift h e R. Then we have for r <C 1 


J T,r ^ J 
< r + r 2 </ 


r)(u h — u) 2 dxdtdhj 




r)(u— j j rju) 2 dxdtj = r + r 2 (D 2 (u, l))i. 




Here and in the proof < and <C just refer to X. 


The second (very similar) step is to estimate the “bulk” L 2 -modulus on the r. 
h. s. of Lemma [2] by the boundary L 2 -modulus of the initial data u(t = —1, •). 

Lemma 3. The stationary solution u of m satisfies 

(D 2 (u , l))i = (J J V{u- J J rju) 2 dxdt ^ 

I rj{u — j pu) 2 dx\ t =-i = 1 + D'~(u, 1). 


< 1 


( 22 ) 


Here and in the proof < and <C just refer to X. 

The third step is to upgrade the purely spatial /T-averaged Holder-A modulus 
of continuity into a space- time modulus of continuity. 

Lemma 4. The stationary solution u of (Op satisfies for r<l 


( / T] r U - 


rj r u) 2 dxdt 


< 


r 2 + 


o 


I rj r (u — I r] r u) 2 dxdtj i 


Here and in the proof < and <C just refer to X. 
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The crucial ingredient for Proposition |T| is the passage from measuring the 
Holder-a L 2 -modulus of continuity on scales 1 down to scales r. It is here 
that we need the deterministic ingredients of Propositions [6] and [3j Not 
surprisingly, we will need in this argument that solutions g to the stochastic 
linear constant coefficient parabolic equation, around which we perturb, have 
this localization property. This is provided by the following localized space- 
time supremum estimate of the Holder-a modulus of continuity of g. 

Lemma 5. For ao 6 [A, 1] let g(ao, •, •) be the solution of 

d t g - a 0 d 2 x g = f fort> -1, g(t = -1, •) = 0 

Then for any Holder exponent ct < | and all shifts h e M 

( sup sup V(g h ~g) 2 ) < min{|/i| 2a , 1}. 

' aoS[A,l] (t,i)6(-l,0)xl ' 1 

Here and in the proof < and <C refer to A and a. 


3 Proofs 

We start the string of proofs with Lemma [T] since it contains the other 
arguments in nuce. 

Proof of Lemma [1] We will establish the lemma in the stronger version 
where instead of D 2 (u, 1), we control the Gaussian moments of E 2 (u, 1) := 
f° 1 f rju 2 dxdt > D 2 {u , 1): 


(exp(i£ 2 (x))}<l. 

By concentration of measure, cf. beginning of Section [2j this is a consequence 
of the bound on the expectation 

<£ 2 (M)><1 (23) 

and the uniform bound on the carre-du-champs of the Malliavin derivative 

|VE(x,1)| 2 <1. (24) 

In order gain confidence in this principle of concentration of measure, let us 
relate it to the discrete case, that is, the case of countably many independent 
normal Gaussian random variables, see for instance m p. 135] for a proof of 
concentration of measure by an efficient and short semi-group argument. In 
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order to make the connection, let us divide space-time into squares Q of side- 
length h (no parabolic scaling needed here), which we think of being small. 
Assume that we are dealing with a function F of the space-time white noise 
£ that depends on £ only through the average of £ on the cubes Q ; which 
amounts to saying that F only depends on {£q}q, where £q := j- f £ dxdt (any 
reasonable function F can be approximated by such functions F h for h J, 0). 
The reason for using this normalization by the square-root of the space-time 
volume h 2 is that the application £ H > {£q}q pushes the space-time white- 
noise ensemble (•) into the normal Gaussian ensemble (-)h- In particular 
(F) = ( F)h and (exp(^F 2 )) = (exp (^F 2 )) h . Hence by the discrete the¬ 
ory, we have concentration of measure provided we have a uniform bound 

on the squared Euclidean (rather Hilbertian) norm |Vh-F| 2 := JOq(J^) 2 of 
the (infinite-dimensional) vector of partial derivatives. Therefore it remains 
to argue that \VhF\ : is dominated by the carre-du-champs |VF 2 of the 
continuum Malliavin derivative. By definition (1T2]) of the latter we have 

lim ±(F(£ + e5£) - F(£)) < |VF| ( / {btfdxdt) 5 (25) 

for any held <5£, hence in particular for a held 5£ which is piecewise con¬ 

stant on the cubes. More precisely, we may assume that 5£ is of the form 
<5 £|q = |A£q for some {(I£q}q so that j- t f Q 5t;dxdt = 5 £q. Because of this 
normalization, the 1. h. s. of (125]) turns into Jby dehnition of the 

partial derivatives, whereas the r. h. s. turns into so ^ ia I by I^ ie 

arbitrariness of {^£q}q, (1251) indeed implies |V^F| < |VF| (in fact, there is 
equality). 

We start with the hrst half of the proof, that is, the bound (123]) on the 
expectation. In fact, we shall establish that 

(E 2 (u,R))< 1, 

provided the scale R ~ 1 is sufficiently large (larger than a constant only 
depending on A). This indeed implies (123]) since by dehnition of the average 
I—R 2 J t]r ■ dxdt, E 2 {u, 1) < R 3 E 2 (u, R), where the power three represents 
the parabolic dimension. By the scale invariance (H]) fc (15]) , we might as well 
show 

(E 2 (u, 1))<1, (26) 

provided the massive term is sufficiently strong, that is, T ~ 1 is sufficiently 
small. In fact, it will be convenient for the upcoming calculation to replace 
the exponential cut-off Tj by fj 2 , where fj is a smoothened version of rj 2 , to hx 
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ideas 


fj(x) := exp(--Vx 2 + 1). 


(27) 


In order to establish (j26j) . we will use a martingale argument based on the 
stochastic (partial) differential equation with (nonlinear) damping 


9tU = -(-ll + (-dl) ir(u)) + 


(28) 


As is constitutive for a martingale argument, we shall monitor a symmet¬ 
ric and semi-definite expression, in our case f fju( 1 — dD^fjudx, where we 
use physicist’s notation in the sense that an operator, here (1 — d 2 )~ l , acts 
on everything to its right, here the product fju. This quadratic expression, 
which amounts to a version of the if -1 -norm that is localized (thanks to 
the inclusion of i)) and endowed with an infra-red cut-off (the effect of the 1 
in (1 — d is motivated by the H^ 1 contraction principle, a well-known 
property of the deterministic versions of (JTj) ; in this language, we monitor 
here the (modified) H _1 distance to the trivial solution u = 0. In general 
terms, the time derivative of such quadratic expression under a stochastic 
equation comes in three contributions: the contribution solely of the deter¬ 
ministic r. h. s. of (I28[) . the contribution solely from the stochastic r. h. s. 
£, and a mixed contribution. In this set-up, the space-time white noise £ is 
viewed as a white noise in time with a spatial (and thus infinite-dimensional) 
covariance structure expressing white noise in space. The mixed contribution 
is a martingale, and thus vanishes when taking the expectation: This cancel¬ 
lation can best be understood when considering a time discretization of (I28p 
that is explicit in the drift —(yu + {—d 2 )i r(w)) (of course, an explicit time 
discretization is not well-posed for an infinite dimensional dynamical system 
coming from a parabolic equation, so one better combines it in one’s mind 
with a spatial discretization). The contribution which solely comes from £ 
is the so-called quadratic variation, and its expectation can be computed 
based on the operator defining the quadratic expression, here fj( 1 — (9)!) -1 ?), 
and the spatial covariance structure of the noise (provided it is white in 
time). Since the spatial covariance structure is the one coming from (spa¬ 
tial) white noise, it is given by the integral of the diagonal of the kernel (i. e. 
the trace-norm of the operator). In case of fj( 1 — d 2 )~ l ri-> the kernel is given 
by rj(x)\ exp(— \x— y\)fj(y). Hence the expectation of the quadratic variation 
is given by f \rj 2 dx. Altogether, the martingale argument thus yields 


d 1 
dt 2 


rju(l 


d 2 ) 1 rjudx 


rju(l-dl) 1 (^rju + fj(-dl)n(u))dx^ + ^ 


-ij 2 dx. 


(29) 
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We rewrite this identity as 


d t , 
A eXp( T' 


= exp(-) 


fju( 1 — d 2 ) fjudx) 


fju(l-d%) 1 (^rju + 2rj(-dl)n(u))dx^ + / ^rfdx), 


and integrate over t G (—oo,0): 


( f exp(^) f fju( l-d 2 x ) 1 (^?]u + 2fj(-d 2 x )n{uj)dxdtj < ^ /' rfch. 


Hence in order to arrive at (1261) . it is enough to show that for T <C 1, we 
have the deterministic estimate 


ijufl-d;) '(-r/u + 2//I — y j7r('// )}rh: > / ifirdl. 


(30) 


We have a closer look at the elliptic term r/(—^)7r(w) in (l30lh whose contri¬ 
bution would be positive by the monotonicity of tt if it weren’t for the spatial 
cut-off and the infra-red cut off. Using Leibniz’ rule, we rewrite it as (in our 
physicist’s way of omitting parentheses) 

v(-dl)n( u ) = (! - d xM u )v + 2<9 x t T{u)d x rj - v{u){rj - d 2 x fj), (31) 

where we w. 1. o. g. assume that 7r(0) = 0. Hence by the symmetry of 
(1 — <9 2 ) -1 we obtain 


fju(l - d x ) fj(-d x )n(u)dx 

= / fj 2 uir(u)dx — 2 (d x fj)Tr(u)d x (l — d 2 )~ l fjudx 


~ (fj- d x f)Mu)(l - d x ) fjudx 


Using that the operators d x (l — d 2 ) a and (1 — <9 2 ) a have operator norm 1 
w. r. t. to L 2 , we deduce the inequality 


r,u(l- a lr^(-8lMu)dx > j r,\Au)dx 

2( [(d x fj) 2 TT 2 (u)dx ) 2 + ( f\fj - dlfj) 2 TT 2 (u)dx) 




( 32 ) 
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By the monotonicity properties (J2j) of tt and our gratuitous assumption 
7r(0) = 0, this yields 


fju(l-d%) l fi{-d 2 x )'K{u)dx 
> A f fj 2 u 2 dx — (%( [ ( d x fj) 2 u 2 dx)‘ 


( r] — d 2 rf) 2 u 2 dx ) : 


x rju(l — d 2 ) l rjudx^j . 


Our smoothing out of the exponential cut-off function, cf. (|27jl . has the sole 
purpose of making sure that 

\dxV\ + \dlv\ % V, (33) 

so that we obtain by Young’s inequality for the elliptic term, 

J fju( 1 — d 2 )~ 1 fi{—d 2 )'n{u)dx > J fj 2 u 2 dx — C J fju(l — d 2 )~ l fiudx. 

We thus see that thanks to the massive term, (15UD holds for T < 1. 

We now turn to the second half of the proof, the estimate of the carre-du- 
champs (l24j) . We first argue that (l24j) follows from the deterministic estimate 


E 2 (5u, 1) < J (5£) 2 dxdt, 

where 5u and are related via 

5u + d t 5u — d 2 (a5u ) = <5£ 


(34) 


(35) 


with a = Indeed, we note that by duality w. r. t. to the inner product 

(. 9,f ) ^ jliJvafdxdt, 


E(u, 1) = sup s E(u,f) : = 


r/ufdxdt 


i-i 


r]f 2 dxdt = 1, supp/ C (—1,0) x 


By the chain rule for the Malliavin derivative we thus obtain 


( 36 ) 


|V£(-,l)|<sup|V£(-,/)|, 

/ 
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where the supremum runs over the set implicitly defined in (I36p . so that it 
is enough to show for a fixed / 

|vb(«,/)I 2 <i- 


By definition of the carre-du-champs of the Malliavin derivative in case of 
the linear functional u i—>■ E(u, /), cf. (TT2j) . this amounts to showing 

J J r/Sufdxdt < 1, 

where the infinitesimal perturbation Su of the solution is related to the in¬ 
finitesimal perturbation of the noise via (|35]) . By the characterizing prop¬ 
erties of the /’s, cf. (1361) . this estimate in turn amounts to establishing (l34|h 

We now turn to the proof of the deterministic estimate (1341) . To ease notation 
and make the connection to Proposition [3l we rephrase (and strengthen) the 
goal: For w and / related via 

w + d t w — d 2 (aw) = /, (37) 

with uniformly elliptic coefficient field a in the sense of (fl4l) . we seek the 
estimate 

/ j w 2 dxdt < / / f 2 dxdt. (38) 


Like for (126|) . our Ansatz for (138|) is motivated by the 77" 1 -contraction prin¬ 
ciple. Again, we consider a version of // '-norm with ultra-red cut-off, but 
this time without cut-off function r], namely J w(L~ 2 — d 2 )~ 1 wdx , where the 
length scale L for the ultra-red cut-off will be chosen later. We obtain from 
the equation 




w(L — cr) (w — f + (—d 2 )(aw))dx 


= — J w(L — d 2 ) 1 (w — f — L ~aw)dx — J aw 2 dx. 

We apply Cauchy-Schwarz’ inequality and use the uniform ellipticity of a, cf. 
m, to obtain the estimate 


m J ™( L - 2 - atr'wdx 

w(L~ 2 — d 2 )~ l wdx — A 
f{(L~ 2 -d 2 x )- 1 w) 2 dx 


< 


w 2 dx 

( [ fdx)~ 2 +L~ 2 ( 


w 2 dx) 
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Thanks to the operator inequality (L 2 — <9 2 ) 1 < L(L 2 — <9 2 ) 2 we have 



so that we may absorb the term (f((L 2 — d 2 ) 1 w) 2 dx)^L 2 (J w 2 dx)z by 
Young’s inequality for L 3 > 1, obtaining 



Integration in time yields (138|) . 

Proof of Proposition [31 We first note that (ITS]) follow easily from (fTTlk 
by translation invariance (1171) also holds with 77 replaced by the shift g y , 
summation over 1 / 6 Z gives (fT 8 l) . We next note that w. 1. o. g. we may assume 
h = 0, since we may rewrite (TT5ji as d t (w — h) — d 2 (a(w — h)) = d 2 (g + ah). 
In this form (1I7|) follows from (TT 6 |) which we will proceed to show now. The 
proof of this proposition is very close to the deterministic part of the proof 
of Lemma HI in fact, it might be seen as an infinitesimal version of it. Like 
there, we substitute 77 by fj 2 , cf. (127)1 . and start from monitoring the localized 
iL -1 -norm of w with infra-red cut-off: 

~ J f)w(l-dl)~ 1 fjwdx = - I d 2 x )~ l f]{-d 2 x ){aw + g)dx. 

As in (HTTf , we write 

r)(-d 2 x )(aw + g) = (1 - d 2 x )(aw + g)fj + 2d x (aw + g)d x f) - (aw + g)(fj - dig), 
which yields 



Using symmetry and boundedness properties of (1 —9 2 )" 1 , and the estimates 
(j33j) on our mollified exponential cut-off fj, the two last terms are estimated 
as 
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and 


< 

r^j 


fjw(l — d 2 ) 1 (aw + g)(fj — dlfj)dx 
^ / fjw(l — dD^fjwdx / fj 2 (aw + g) 2 dx 


Hence we obtain by the uniform ellipticity (fl4|i of a together with the triangle 
inequality to break up aw + g and Young’s inequality 

1 


— j gw( 1 — d 2 ) l gwdx + — J r] 2 w 2 dx 
< t)w{1 — d 2 )~ l rjwdx + J fj 2 g 2 dx S j, 


which we rewrite as 

d /» 

— exp(— Ct) / f)w(l — dl^fjwdx 

+ — exp(— Ct) J fi 2 w 2 dx < C exp(—Ct) J 7) 2 g 2 dx. 
The desired estimate (TT6|) follows. 


Proof of Proposition [H We will again work with a smooth version rj of 
the cutoff yjfj, setting 

V(x) = ^exp ( - ~\/M 2 + !) Vr(x) = fj(x/r). 

Note in particular that this time the cut-off rj r at scale r is not normalised 
to preserve the L 1 norm. We will establish (TT9|) in the form 



fj 2 v 2 dxdt < r 1+2a ° j 1 — d 2 ) 1 (fjv)dx\ t= _ 1 . 


(39) 


for some cko > 0. 

We start by defining the auxiliary function V = — d x [l — d 2 )~ 1 (fjv), which 
we think of as a localised version of the anti-derivative of v. We claim that 


d t V-d x (ad x V) = d x g + f (40) 

where 


g : = a(l - d 2 ) 1 (fjv), 

f '■= 2^(1 - dly 1 {a{d x f})v) - d x (l - d 2 x )~\a(fi + d 2 x fj)v). 
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( 41 ) 


To see (1101) we first observe that 

9tV = -d x (l - d 2 x )-\f]dtv) = -d x {l - <9*) _1 {rjd 2 x (av)). 

Then we write 

fjdl(av) = dl(rjav) - 2(d x fj)d x (av) - ( d 2 fj)av 
= dl(fjav) - 2d x ((d x fj)av) + ( dlfj)av. 

Plugging this into dll]) we get 

d t V = -d x {l - d 2 x )~ l dl(afjv) - 2 d x (a(d x fj)v) + a(d%.fj)v 

= d x (afjv) - d x (l - dlY 1 afjv - 2 d x (a(d x fj)v) + a(d%f])v 
= d x (afjv) + f. (42) 

On the other hand we have 

d x V = -d 2 (l - d 2 x )- 1 {fjv) =ijv-(l- dly^fjv), (43) 

which together with d!2j) implies that 

d t V = d x (ad x V) + d x (a( 1 - ^) _1 (? jv)) + /. 

So (HOI) follows. 

Our next step is to derive a suitable version of Caccioppoli’s estimate on 
scale r -C 1 for (HOD . We can write for any c e M 

d t j nl\( v ~ c f dx 

= - [ d x (fj 2 (V-c )) ad x V + g dx + J rj 2 (V - c)f dx 

2(d x fjr)fjr(V - c) + f] 2 r d x v) ad x V + g dx + f fj 2 (V - c)f dx. 

(44) 

We treat the terms on the right hand side of this expression one by one. First 
we get 

2 (d x fj r )fj r (V - c) + rj 2 d x V)ad x V dx 


<-A / g r {d x V) dx + 2i / (d x fj r ) (V - c) dx) / f} r (d x V) dx 


< 


C 


fj 2 (d x V) 2 dx + ^ [ fj 2 (V — c) 2 dx, 


(45) 
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where in the last line we have absorbed the second term involving d x V in the 
first one and used the point-wise estimate \d x fj r \ < -fj r . For the second term 
on the right hand side of (1441) we get 


- J (f(d x fj r )fj r (V - c)+ fj^d x V)g dx (46) 

< Q2 J V 2 (V - cfdx^j 2 ( J fj 2 r g 2 da;) 2 + ( y rf[d x Vfdx ) 2 ( J f r g 2 dx') 2 . 


The integral involving (V — c) and the integral involving can be absorbed 
into the terms on the right hand side of (1451) . The only contribution that is 
left is the integral J fj 2 g 2 dx. Finally, for the last term on the right hand side 
of (ITT]) we get 

J fj 2 (V - c)f dx < (supply - c|) J fjr\f\ dx. 

Summarising, we obtain 

j f j - c f dx + y J fj 2 (d x V) 2 dxdt 

~ “2 y Vr( v ~ c ) 2 dx + y fj 2 g 2 dx+(^supf) r \V-c\^ j fj r \f\dx. (47) 


As a next step we integrate this estimate in time. To this end, let C; K. —K. 
be non-negative, non-decreasing such that £ = 0 on (— 00 ,—2), £ = 1 on 
(—l,oo) and with (' < 2 on M. We set ( r (t) = ft/r 2 ). Then integrating 
(ED against ( r we get 


fj 2 (d x V) 2 dxdt 


SS ^\ (v - c)2 ixit+ L., 

0 p pO p 

/ f(V — cfdxdt + / / gig 2 dxdt 

J- 2r 2 J J- 2r 2 J 


< — 

r\_/ ^2 


+ ( sup fj r \V — c\ J 

' (-2r 2 ,0)x» ' 


1-2 r 2 


fj r \f\ dxdt, 


(48) 


where we have absorbed the term J_° f(dt( r )vl |(V — c) 2 dxdt in the first 
term on the right hand side. We proceed by bounding the first term on the 
right hand side 

— [ [rfAV-c) 2 dxdt < ( sup gJV — c) 2 ]— [ [ fjrdxdt 

rZ J-2r 2 J ' (—2r 2 ,0)xR ‘ r " J-2r 2 J 

< r sup fj r (V - cf, 

(—2r 2 ,0)xM 
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the second term by 



g 2 g 2 dxdt < r 3 sup g 2 

(—l,0)xR 


and the last term by 



Inserting these estimates in (l48]i we arrive at 



r]r(9 x V) 2 dxdt 


< r sup g r [V ~ c) 2 + r 3 sup q 2 + r 2 

(—2r 2 ,0)xR (—l,0)xR 



/ 2 dxdt, 


(49) 


where we have dropped the non-negative term f f] 2 \(V — c) 2 dx | t=0 on the 
left hand side. 

In the following crucial step we will use the De Giorgi-Nash Theorem to 
obtain a slightly larger power of r in the first term on the right hand side of 
039]). More precisely, we will use the estimate 


VL 0 , H ,„,<(/_' v 2 dxdtf + 

+ sup \g\, 

(—l,0)x(—1,1) 




1 

2 


where \V] ao denotes the parabolic «o Hblder norm of V (defined as in (fill) ). 
We refer to Theorem 8.1 and Theorem 10.1 in [TO]; Theorem 8.1 gives local 
control of sup t 3 , \V\ in terms of (f V 2 dxdt )^, and Theorem 10.1 gives local 
control of \V] ao in terms of sup t3 , |V|. Our control of the right hand side g 
and / in (ITU]1 through the local norms sup 4 x \g\ and (f f 2 dxdt)^ is well within 
the allowed range, cf. (7.1) & (7.2) in [TU], which in one space dimension can 
deal with control of f f (\g\ 2 + f) 2 dxdt (n — 1, q — r — 2, and k — \ in the 
notation of this reference). 
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We assemble the last estimate to 

< 


Sirp[V] ( —1 o)x(fc-|,fc+i) 
fcez 


pO p oo . 1 

' J V 2 dxdtj 2 + 
+ sup |p|. 

(—1,0) x (— 00 , 00 ) 


/■0 /"oo 
'-1 J-00 


f 2 dxdt] 


Going back to (TFJD we choose c = V( 0, 0) and write 


sup rj r (V(t,x) — V( 0, 0)) 2 

(t,a;)e(-2r 2 ,0)xK 


< 


H*l 


£■ 

fee z 

<r 2 “° sup[h] a(Ji( _i 0)x( j._i j.,).!). 


sup 

(t,x)£(— 2r 2 ,0)x [r(fc—U,r(fc-|-|)] 


(U(f,x)-U(0,0)) 5 


fcez 


Combining these estimates we obtain the following inverse Holder inequality 
[ [ fj 2 (d x V) 2 dxdt 


< ^,l+2oo 


c. 


u 2 + 


'-1 


f 2 dxdt + sup q 
(-1,0) xR 


(50) 


valid for r -C 1. 


It remains to control the functions / and g. For / we have by definition 

J j f 2 dxdt < J J (d 2 x (l - d 2 x y\a(d x fj)v)^ dxdt 

+ J J (< 9 X (1 - dly l (f) + dlfj)(av) S j dxdt. 

Using the boundedness of <9 2 (1 — d 2 )^ 1 on L 2 , as well as |a| < 1 and \d x fj\ < 
fj we can see that the first integral is bounded by J ( fjv) 2 dx . Using the 
boundedness of d x (l — d 2 ) 1 on L 2 and the point-wise bound \fj + d 2 fi\ < fj 
we bound the second integral by the same quantity. For g we write for any 
t using the embedding H l <—>• L°° 

sup.9 2 < sup ( (1 - d 2 x y\fjv)) 2 

xgR x 

< J (d x {l -d%y 1 (f}v)) 2 dx + J ((1 - d 2 y\fjv)) 2 dx 

= Ji 1 ~ d l)~\vv) ( - d 2 (l - d 2 x y\fjv)) dx + J ((1 - d 2 x y\fjv)) 2 dx 

< J(fjv)(l-8%y 1 (fjv)dx, 
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where in the last step we have used the boundedness of (1 — d 2 ) 


1 

2 


on L 2 . 


Summarising these bounds and plugging in the identity (143]) which expresses 
d x V as fjv and a higher order term we obtain 



fj 2 (fjv) 2 dxdt 


< ^l+2«o 





+ sup 

t€(—1,0) 


(fjv)(l — d x ) (fjv) dx 


+ 



(■ gv) 2 dxdt 

b/, 2 fll - dl)- 1 (r t v)fdxdt. 

(51) 


The first term on the right hand side is bounded by the second term due to 
the boundedness of d x (l — d 2 )~ v on L 2 . For the last term on the right hand 
side we can write, using once more the embedding H 1 <—>• L°° 

fj 2 (( 1 - d 2 )- 1 (fjv)ydxdt < r 3 sup f (fjv)(l - d 2 x )~ 1 (fjv) dx , 

te(-i,o) J 



so this term can be absorbed in the second term on the right hand side of 
(I5T|) . In this form the estimate trivially holds r which are bounded away 
from 0, so that we can conclude that it holds for r < 1. Then the desired 
estimate (1391) follows from estimate (1I6|) in Proposition [3j We finally note 
that we may replace the cut-off fj by y/fj because the kernel of (1 — d 2 )~ l is 
non-negative. 


Proof of Proposition [D For conciseness, we ignore the massive term in 
©• The main object of this proposition is 5u := u h — u, where u h (t,x) = 
u(t,x + h) denotes a spatial shift of the stationary solution of (JU). We note 
that 5u satisfies the formally linear equation 


d t 5u - d 2 (a h 5u) = (d t - a 0 d 2 x )5g, (52) 

where we introduced the coefficient held 

ah = [ ir'(cru h + (1 — cr)u)dc r, (53) 


which by (J2]) is uniformly elliptic in the sense of (fT4]h and we have set Sg : = 
g h — g, where g is defined via the linear version of (JTJ) 

d t g — aad 2 g — for t e (—1, 0), g — 0 for t — —1, (54) 


24 




cf. Lemma El with a constant coefficient a 0 G [A, 1] to be chosen below. 


We start with the main deterministic ingredient for Proposition [IJ which we 
need to go from scales of order one to scales of order r <C 1 in an L 2 -averaged 
Holder modulus of continuity, ft is given by the estimate 



(55) 


for a 0 > 0 from Proposition El which we shall establish for all r < 1. 

To this purpose, we split the solution 5u = Sg + v + w, where v is defined 
through the initial value problem with homogeneous right hand side 


d t v — dl(ah,v) = 0 for t G (—1, 0), v = 5u for t = —1, (56) 

and where w is defined through the initial value problem with homogenous 


initial data 

d t w — dl(cihiu) = dl((a>h — ao)5g) forte (—1,0), w = 0 for f = — 1. (57) 

Taking the sum of (l54jh (1561) and (1571) . and comparing with (l52]h we see that 
this indeed gives Su = 5g + v + w. 

We first address v. From the estimate (TTU]1 in Proposition SJ we learn that 
there exists an ckq > 0 (depending only on A) such that 



(58) 


By Leibniz’ rule an its discrete form of 

r] 2 <)u = rj 2 5(u — c) = 5(r) 2 (u — c)) — 5rj 2 {u h - c) 
= S(r] 2 (u - c)) - (6r)z h (u - c)) h 


and the triangle inequality we can write the integral on the right hand side 
of (158]) as 
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where we have set V (1 — d 2 ) 2 772(77 — 0), and c can be chosen arbitrarily. 
We use the point-wise bound \Sr] 2 h \ < \h\e~ 77 2 , the point-wise bound g 2 < 77 
as well as the boundedness of (1 — < 9 2 ) _2 on L 2 to bound the second term as 
follows 

J ((1 — dl)~^5r]2 h (u — c)) 2 dx < h 2 e^ J g{u — c) 2 dx. 

For the first term we get 

J ( 5V) 2 dx < h 2 j ( d x V) 2 dx = h 2 j (d x (l — d 2 )~^g 2 {u — c)) 2 dx 

< h 2 J rj(u - cfdx , 

by the boundedness on L 2 of d x (l — d 2 )~ l . Summarising these bounds and 
chosing c = [ r/udx\ t=1 we get 


V r v 


2 dxdt < r 2a ° [—) 2 e^D , 2 (u, 1). 


We now turn to w. Applying the first part (fTTj) of Proposition [ 3 ] (with 77 
replaced by 771), to ( 157 j) we gather that 


ro r r° 

^ I /„ „ \ 2 / 


r)iw 2 dxdt < 


771(0/,. — ao) 2 ( 5 g) 2 dxdt , 


'-1 


'-1 


which implies for r < | (by the obvious inequality J g r ■ dxdt < (-^) 3 x 
J rjji ■ dxdt for r < R and since 771 < rj 2 ) 

r 3 4 / r] r w 2 dxdt < / / 77(0/, — a 0 ) 2 dxdt sup g( 5 g) 2 . 

J —r 2 J J— 1 t/ (t,x)E(—l,0)xR 

Finally, because of j ^ r2 J ^dxdt < 1 for r -C 1 , we have for the last contri¬ 
bution (fg 


/ g r (dg) 2 dxdt < sup t?^#) 2 . 

; J (t,i)e(-i,o)xR 

Combining the four last estimates yields ( 1531 ) for hu = < 5 g + v + w. 

We now post-process ( 1551 ) and to that purpose make the choice of «o = 7 r'(c) 
with c := f gudxdt , so that in view of the definition ( 1531 ) of a/, and the 

Lipschitz continuity (j 3 J) of 7r' 


\a h - a 0 | < \a h - ti'{u)\ + |7r'(w) - 7r'(c)| < L(\ 5 u\ + |u — c|). 


( 59 ) 
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Therefore, (after replacing r by 2r in order to make 77 2r appear, which is no 
problem thanks to r <C 1 ) (15 5 p turns into 



(60) 


<r 2 “»(-)V ,| D' 2 (ii,l) 

/ L 2 /"° f L 2 \ 

+ (H —t / v( uh ~ u) 2 dxdt H— -D 2 (u, 1)) sup r](g h 

' H J -1 y r A J (t,a;)G(—l,0)xR 


fl 1 ) 2 - 


Now we integrate in h according to J rj 2 r{h) ■ dh. As we shall argue below, 
we have for the 1 . h. s. of (1601) 

J V 2 r(h) -j- J rj 2 r(u h >-u) 2 dxdtdh > j J rj r (u — J ij r u) 2 dxdt. (61) 

For the first term on the right hand side of (16U|) we observe that for r < i 
[ il 2 r{h)r 2ao ) 2 e^D' 2 (u, 1 )dh < r 2a °D' 2 (u , 1). 


The second term on the r. h. s. of (16011 comes in form of a product of two 
h-dependent functions we momentarily call f\(h) and / 2 (h). To this purpose 
we use that thanks to 4r < 1 we have r/ 2 r < rjrj 4 r for our exponential cut-off 
so that J ri 2 rfif 2 dh < swp h (r}fi) J rj 4:r f 2 dh. We claim that for the first factor 
in the second term on the right hand side of (1601) we have 


sup 77 (h) 
h 



rj{u h — u) 2 dxdt 


< 

rs.7 


'-1 


77(77 — J J r]u) 2 dxdt = D(u, 1 ). 

(62) 


Before inserting them, we give the easy arguments for (16TT) and (E2D: By 
scaling we may assume r = 1 so that (16TT) follows from Jensen’s inequality in 
form of 


77(77 — / r]u) 2 dx < 



r]{x)r]{x + h)(u h (x) — u{x)) 2 dxdh 


and the fact that for our exponential cut-off r}(x)r}(x + h) = |exp(—(|x| + 
\x + h\)) < |exp(— \(\h\ + |x|)) = Arj 2 {h)r] 2 (x). For (RxZD . by the triangle 
inequality in L 2 , it is enough to show for a constant c (f° 1 J r/udxdt in our 
case) 


sup 77 (h) 
h 


r)(u h — c) 2 dx < / 77(77 — c) 2 dx. 


(63) 
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This inequality follows from writing 


sup r)(h) / g(u h — c) 2 dx = sup / g{h)g{x — h)(u(x) — c) 2 dx 

h J h J 

and the fact that for our exponential cut-off g{h)g{x — h) < rj(x). Inserting 
dSH and (jS2D into (jfiOl) we obtain 

-j- J rj r (u — J r} r u) 2 dxdt < r 2a °D' 2 (u,l) 

(l + ~J d2 (u, 1)) [ V4r(h) sup r](g h — g) 2 dh. (64) 
^ T ' J (t,®)G(-l,0)xR 


Before taking the (restricted) expectation of this inequality, we note that our 
choice of c = f° 1 J gudxdt depends on u and so does our choice of coefficient 
a 0 = vr'(c). Therefore g has to be viewed as the solution of a stochastic 
heat equation with constant but random , non-adapted coefficients and the 
standard regularity estimates do not apply immediately. This problem is 
addressed in Lemma [5] where a bound on (sup aog [ A)1 ] sup(_ 1)0 ) xR r}(g h — g) 2 )i 
is provided. So when taking the (restricted) expectation of (the square root 
of) (16-0) and inserting this estimate provided by Lemma ED we obtain 


r) r (u — / g r u) 2 dxdtj ^ 


< r a °[l + D’(u,l) + —(D 2 (u,l))! ). 

7*2 


(65) 


We now appeal to the triangle inequality in form of 


D(u,r ) < 


rj r U 


Or .1 

2 ,A 2 


rj r u) 2 dt 


rj r (u — / g r u) 2 dxdt 


( 66 ) 


and Lemma [4] for the upgrade to 

(D(u,r)) i < + r ao (l + D'(u, l) + ^(D 2 (u, l))f), 

which we rewrite as 

{DM)! < r ao (l + D\u,l) + ^{D 2 (u,l))l). (67) 

\ 7 * 2 / 
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In this form, we see that (KJ7I) does not just hold for r C 1 but trivially for 
r < 1 with r ~ 1, since D(u,r) < dj D(u , 1). It remains to appeal to Lemma 


El 


Proof of Proposition [2j For conciseness, we ignore the massive term in 
(ITT) and fix r < 1. Following the argument in the proof of Lemma (H we first 
claim that the proposition reduces to the following deterministic estimate 

D 2 (Su,r ) < — j (5^) 2 dxdt ( 68 ) 

for any decaying Su and 5£ supported for £ 6 (—1, 0) related via 

d t 5u — d 2 (a5u ) = 5£, (69) 


where a := satisfies m- Indeed, we note that by duality w. r. t. to 

the inner product (g, f ) i->- 2 J ri r gfdxdt, 


D(u,r ) = sup | D(u, / rjrufdxdt 


(70) 


J rj r f 2 dxdt = 1 , supp/ C (—r 2 ,0) x M, j- J r) r fdxdt = 0 
By the chain rule for the Malliavin derivative we thus obtain 


\X7D(u,r)\i < sup \\7D(u, f)\i, 
f 

where the supremum runs over the set implicitly defined in f|70p . so that it 
is enough to show for a fixed / 

ivz?(«,/)|f < 

By definition (fT2|) of the carre-du-champs of the Malliavin derivative applied 
to the linear functional u e-)■ D(u, /), this amounts to show 

■j- J r] r 5ufdxdt < — J (6C,) 2 dxdt S j 2 , 

where the infinitesimal perturbation Su of the solution is related to the in¬ 
finitesimal perturbation S £ of the noise supported on (—1,0) x M via (jffUl) . 
By the characterizing properties of the /’s, cf. (1701) . this estimate in turn 
amounts to f[68lh 
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In order to see (1551) we use the trivial estimate 

D 2 (6u,r ) < — J j du(t,x) 2 dxdt 

and apply Proposition [3] for h = 5^d.s. Observing that is a bounded 

operator on L 2 (— 1 , 0 ) we obtain ([ 68 ]) . 


PROOF OF Theorem CD In this proof < and <C refer to constants only 
depending on A and eventually on L and a. For some 9 G (0,1) to be chosen 
later we consider the random variable D(u,9). By Proposition [2] we know 
that D(u, 6) is a Lipschitz variable with respect to perturbations of the noise 
which are supported in (— 1 , 0 ) xR and we have 


By concentration of measure, cf. the beginning of Section [21 applied to the 
restricted ensemble (-)i we conclude that suitably rescaled fluctuations 

x = ei(D(u,e)-(D(u,e)) 1 ) 

satisfy (exp(A ; y ; 2 )) 1 < 2 , and thus a fortiori (exp^y 2 )) < 2 . Combining this 
with Proposition Q] we get the almost-sure inequality 


1 

9 a ° 


D(u,9)<l + D'(u,l) + 


Ts 1 ) + 1 ) + 

9 2 


1 

9§+ a ° X 


By the invariance in law under the scaling (p[|) &; (]5|) Sz (161) ., this yields for 
any length scale R < 1 


1 1 

0 ao Rt 


-D(u, 9R) <1-1 t D'(u, R) -| — 3 — (— yD'(u, R) + l) + 


R 2 


e~. 


6 >§+“° 


XRi 

(71) 


with an i?-dependent random variable \R °f Gaussian moments (exp^y^)) < 
1. Then using the fact that D' 2 (u, 1 )(u,R) = J t]r(u — f r]Ru) 2 dx\ t= ^R 2 sat¬ 
isfies 


R 


D\u , R!)dR! = 


R 


VR'( U - / VR' U ) dx\t=-R2 ) dR! 


< 


R 


R 


VR'{.u 


-R ■ 


m u ) dx\ t= - R 2 ) R'dR 


< 


—R 2 


Vr(u 


-R ■ 


r] R u) 2 dx) dt < D(u,R). (72) 
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we see that by replacing R by R' in (fTTjl and by averaging over R' 6 (j,R) 
we obtain 


— — r D('U, 9R) < 1 + —-D(u, R) + - 3 -(D(u, R) + Ri) + 

V 0 Ri R 2 02 


1 

Q^+ao 


x'r, 


where x'r := fn XR'dR' still has Gaussian moments (exp {■^x'r)) < 2, since 
the latter property is preserved by convex combination. In order to prepare 
for recursion we introduce an a < cx 0 an d rewrite the last estimate as 


my 


-D(u, 6 R) <(R 2 - a 6 ao ~ a + 


LR 


1—a 




) + (* 


\ao~a 


+ 


R?L 


^ 2 —( Q °— a ) R 


)—£>(«, i 2 ) 


R 2 - 


0 l+“ 


Xr- 


Thanks to a < «o we can first fix 6 <C 1 and then R 0 <C 1 so that this 
estimate turns into 


1 

( 1 9R) a 


D(u, OR) < 


1 1 
2 R a 


D(u, R) + CR^~ a x' R + C 


(73) 


for all R < Rq. This prompts to consider the random variable 

Xr q ■= max (0 n )m x r 

n= 0,1, 


which in view of (recall a < |) 


Xr„ < 

n =0 

= -j— x convex combination of {x'enR 0 }n= o,i,-- (74) 

1 — 02 

has Gaussian moments (exp (^X 2 )) < 2 since by construction, the random 
variables have a uniform Gaussian moment bounds. Thanks to the 

factor | < 1 the estimate (1731) can be iterated to yield 

sup 7 ^; D ( u mO n ) ^ D(u,Ro) +x+ 1, 

n=o,i,••• ) 

which implies (using once more D(u,r ) < (—)f D(u,R) for any scales r < R 
to bridge the dyadic gaps as well as the gap between R 0 and 1) 

sup —D(u, r) < D(u, 1) + x + 1. 

r<l r a 
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Theorem [T] now follows by invoking Lemma [T] and relabelling a as ck 0 - 
Proof of Proposition O We start by observing 


d 1 
dt 2 


rj(d x v) 2 dx = * / d x (r)d x v)(a 0 d$.v + f)dx 


r)(a 0 (d 2 v ) 2 + fd 2 v)dx - / d x r]d x v(a 0 d 2 v + f)dx, 


so that because of ao G [A, 1] and \d x r/\ < 77 we obtain by Young’s inequality 
~ f v(d x v) 2 dx < - / i 7 (A(^n) 2 + /^n)dx+ f rj\d x v\(\d 2 x v\ + \f\)dx 


< I r]{d 2 x v) 2 dx + C J r]((d x v ) 2 + f 2 )dx. 

Dropping the good r. h. s. term, we rewrite this as 
d 


-^(t + 1) / r)(d x v) 2 dx < 


d{{d x v ) 2 + f 2 )dx, 
so that we obtain from integration in t G (—1, 0) 


(75) 


sup (t + 1) / n(d„v) 2 dx < / / rf((d x vf + f 2 )dxdt. (76) 

te(— 1,0) J J - 1 J 

Thanks to the constant coefficients, also the (localized) L 2 -norm is well- 
behaved. Indeed, from (12Uj) we obtain 

~ I rjv 2 dx = J r]v{a 0 d 2 x v + f)dx 

= / r](-a 0 (d x v) 2 + vf)dx - a 0 / d x rjvd x vdx, 


so that because of ao G [A, 1] and \d x r]\ < r], we obtain by Young’s inequality 
j r]v 2 dx < j rj(d x v) 2 dx + J r](v 2 + f 2 )dx. 

From the integration in t of this differential inequality for J rjv 2 dx we learn 


sup / r]v 2 dx+ / 
te(-i,o) J J - 1 


v(d x v ) 2 dxdt 


< 


r]f 2 dxdt+ / r]v 2 dx\ t =-i. (77) 


'-i 
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The combination of this with (j75j) yields 


sup (( 1 +t) / r)(d x v ) 2 + / r\v 2 dx ) 


< 


te(-i,o) 


rjf 2 dxdt+ / rjv 2 dx\t=-i- (78) 


In view of this a priori estimate f|78|) . the desired estimate (12T| follows from 
the embedding 

sup (1 + t) 1 2 V v 2 < sup ((1 + t) [rj(d x v) 2 dx + fr)v 2 dx), 
(t,i)e(-i,o)xi te(- 1 , 0 ) J J 

which easily is is seen to hold: Because of 

supgv 2 < J \d x {j]v 2 )\dx < J (rj\vd x v\ + \d x rj\v 2 )dx < J ri(\d x v\ + \v\)\v\dx, 
we obtain by Young’s inequality for t G ( — 1,0) 

(t + 1 )^ sup r\v 2 < (1+t) J r]((d x v) 2 + v 2 )dx + J r]v 2 dx 
< (1 + t) rj(d x v) 2 dx + / r\v 2 dx. 


Proof of Proposition [5j As before we ignore the massive term in ([T]) 
and monitor 5u := u h — u, where u h (t, x) = u(t, x + h) denotes a spatial shift 
of the stationary solution of (P) . As in the proof of Proposition |T] Su satisfies 
the formally linear equation (I52[) . 

We will again derive a recursive estimate to go from scale one to scale r for 
the functions Su. This time it is given by the estimate 



r] r (Su) 2 dxdt < 


x 


1 

1+^7 


L 


r)(a,h — ao) 2 dxdt^ 

f r}(5u) 2 dxdt + sup rj(5g) 2 \( 79) 
1 J (t,®)G(-l,0)xR ' 


which we shall establish for all r 1. Compared to (f551) the term 

r 2 ao (^) 2 e w D' 2 (u,l) 


on the right hand side, which limits that Holder regularity to the strictly pos¬ 
itive but possibly small ao > 0 coming from Proposition [4] has disappeared. 
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The price to pay is the extra term f g(6u) 2 dxdt which will eventually 
lead to an estimate which is quadratic in u. 

We observe that it is enough to establish for any R £ [4,1] the estimate 



g r (5u) 2 dxdt 


< 


x 



(t + R 2 ) *r)(a,h — a 0 ) 2 dxdt' S j 

-b? + sup v($g) 2 ), 

(t,x)e(--R 2 ,0)xl 7 


since the integral of this estimate over R e [|, 1] yields ([55]) . using the in- 
tegrability of (t + R 2 )~^ thanks to | < 1. To simplify notation, we replace 
R ~ 1 by unity, so that it remains to show 



rj r {5u) 2 dxdt < ^1 H— - j" j (t + 1) 2 g(dh — a 0 ) 2 dxdt'j 


x ( / 7?(hu) 2 Gfc|t=-i + sup v(dg) 2 ). (80) 
' J (t,x)6(-l,0)xl 7 


As before we split the solution 5u = 5g + v + w, but this time v is defined 
through the constant-coefficient initial value problem 

d t v — a 0 d 2 v = 0 for t e (— 1 , 0 ), v = 5u for t — — 1 , 

and w is defined through the initial value problem 

d t w — d 2 (dhW ) = d 2 ((a,h — a 0 )(5g + v)) for t G (— 1 , 0), w = 0 for t = — 1 . 

(81) 

In view of the constant coefficient do, this time we can bound v using Propo¬ 
sition [ 6 ] rather than Proposition |4j More precisely, the first bound (l 2 T| of 
Proposition [ 6 ] implies that 

sup (1 + t)^gv 2 < /'r7(<5n) 2 Gte| t= _ 1 , (82) 

(t,a;)G(-l,0)xM J 

which implies in particular for r < | (which amounts to r < 4 before setting 
setting R = 1 above) 

J VrV 2 < J g(5u) 2 dx\ t= -i. (83) 
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To bound w we use the first part (HTTP of Proposition [3] (with 77 replaced by 
r/i) to gather that 



rji(a h - a 0 ) 2 ((5g) 2 + v 2 )dxdt, 


which implies for r < | (using the obvious inequality J^ r2 J g r ■ dxdt < 
(y ) 3 f° R 2 f t]r ■ dxdt for r < R and since r]i <rf) 


r 


3 



rj r w 2 dxdt 


(84) 


< 
r \j 



r)(a,h — a>o) 2 dxdt 


sup (1 + t)^r)((5g) 2 + v 2 ). 
(t,x)E(— 1,0) xR 


Inserting 


into 


yields 


77 r w 2 dxdt < — / (1 + t) 2 / 77 (ah — a 0 ) 2 dxdt 


'-i 


x ( / g(du) 2 dx\ t= ^i + sup g(Sg) 2 ) 
v J i,o)xR / 


Finally, as in Proposition Q] we have 

r-0 p 


g r (5g) 2 dxdt < sup rj(Sg) 2 . 
(t,x)£(— l,0)xR 


Combining the two last estimates with (1531) yields (j5U|) for Su = 5g + v + w. 

The post-processing of (1731) follows the same lines as the corresponding ar¬ 
gument in the proof of Proposition [1] and we only give a sketch. Making the 
choice of a 0 = 7 r'(c) with c := J gudxdt and using (1331) . the estimate (173]) 
turns into 



< 

r^j 


x 


J V 2 r{u h — u) 2 dxdt 

/ L 2 f L 2 \ 

yl + — J j g(8u) 2 dxdt + —D 2 (u,1)J 

/ g(6u) 2 dxdt + sup v{dg) 2 J 

J (t,x)G(-l,0)xR ' 



Then we integrate this estimat in h according to f r/ 2 r(h) ■dh, and use (16H) to 
compare the left hand side to j^ r2 J rj r (u — J g r u) 2 dxdt and ( 1621 ) to compare 
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the supremum of first factor on the right hand side weighted by r/(h) to 
1 + -^ 3 -D 2 (u, 1). fn this way we obtain 



g(5u) 2 dxdt + sup g(5g) 2 )dh. (85) 

J (t,x)€(— l,0)xR 



Then, we take the restricted expectation of the square root of (J85j) . As in the 
proof of Proposition [T] (keeping in mind the discussion about the dependence 
of g on the random coefficient ao), Lemma 0 allows us to control the term 
involving g to obtain a factor r a for any a < L t . For the term involving 8 u in 
the second factor we invoke Lemma [2] and obtain 





1 

2 


1 


< 


(l + ^j{D 2 (u, !))i 2 )(r 2 +r(D 2 (u, 1))? +r“). 


Then as in the proof of Proposition Q] we make use of the triangle inequal¬ 
ity (jfiijD and Lemma 0 to replace the term ^ ^ f^ r2 f r] r (u — f r] r u) 2 dxdt j 
on the left hand side by (D(u,r))i. In this way we finally obtain 


(D(u,r)) i 


< 


i 1 + 4-(^ 2 («, 1 ))i ) i 1 + r H D2 (u, 1))? 


which yields the desired estimate after appealing to Lemma [3J 


Proof of Theorem [21 In this proof < and <c refer to generic constants 
only depending on A, a, and eventually L. The first steps in this proof resem¬ 
ble the argument used in the proof of Theorem [TJ According to Proposition 
[Owe have 

i<D(u,e)) t < (1 + 4(1 + D'(u, 1))) (1 + oiD'(u, 1)) 

(7 (j 2 

and according to Proposition [2] 

|V£>(m)|i<H. 
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As in the proof of Theorem [TJ we apply the concentration of measure to the 
restricted ensemble (•)! and obtain the existence of a random variable x with 
Gaussian bounds, (exp(Wy 2 )) < 2 such that 

Td(„, 9) < (1 + 4(1 + D\u, 1)))(1 + eiD'(u, 1)) + J-x, 

v y 2 y 2 ' 

where we think of small 0 <C 1. We apply the invariance in law under the 
scaling (0]) & (EJ) & ©, to obtain for any length scale R 

~D{u, OR) < (l + + D’(u, R))) (1 + A D'(u , R)) + 4~Xr, 

U JX 2 0 2 XX 2 

( 86 ) 


with an R -dependent random variable xr of Gaussian moments (exp( ^Xr)) A 

2 . As in the proof of Theorem [T] we integrate out the initial time —R in this 

estimate using (172]) to get J-r D\u , R!)dR' < D{u , i?) and get 

17 2 


X T-B(«, OR) < (1 + + B(«, fl))) (1 + Al>(«. R))+R- 

UR 2 02 XT 2 02" 1 " 0 


02 


Xrj 


where := Jr Xr'^R still has Gaussian moments (exp(A\' H 2 )} < 2. 

At this point our argument deviates from the proof of Theorem [TJ because 
the bound on the right hand side is non-linear in D(u,R). We reformulate 
it as 


D(u,R) < 02 
R < e 3 


~D(u,0R)<C^ r D(u,R) + 

V R2 R2 U 2 


g^+a 


Xri 


where Xr ~ Xr + 1 still l ias Gaussian moments (expfWy;^ 2 )) < 2. Hence 
selecting 6 ~ 1 sufficiently small, we obtain 


D(u,R) < 6* 
R < 6 3 


1 11 R2~° 

—— D(u,9R ) < D(u, R) H- 5 — 

(0R) a ~ 2 R a y ’ 6)|+« 


Xr- 


(87) 


Since (IHT|) implies in particular D{u , OR) < ^tXr + \D(u, R), we see that in 

order to convert (1871) into a self-propelling iteration, we need R^Xr A ^0 3 , 
(0R)^x'eR — anc l so on - Now we can proceed as in the proof of Theorem Q] 
and consider the random variable 


Xr '■= max 

71=0,1,- 


(i 0 r 


^- a Xer 


R — 


max 

71=0,1,- 


(0 


<n\\ n 

2 XV 


Ri 
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which thanks to a! < \ can be written as a multiple of a convex combination 
°f Xe n R as in (TblD . and which therefore has Gaussian moments ( exp (prX 2 )) < 
2 itself. From (1871) we learn 


D(u, R)<6?, R<9 3 and R*xr < x# 3 


VneN 


(9 n R)' 


-D(u,6 n R ) < 


1 1 

- . D(u, 9 n ~ 1 R) + -3 — xr- 

2 (9 n ~ 1 R) a v 9i+« 


Thanks to the factor < 1 the last statement can be iterated to yield 


D(u, R)<9~2 , R<9 3 and R*xr < ~^ 


SU P m ® UR ) < \tT D ( u i + 2 ~3- 

„=o,i,- ( 9 n R) a v ’ - 2 R a v ' 0§+“ 


XRi 


which implies (using once more D(u,r ) < (—)a D(u,R) for any scales r < R 
to bridge the dyadic gaps) 


D{u,R)<9*, R<9 3 

1 


and 
1 


R 2 Xr< g 103 


sup— D(u,r) < ——- 
r<R.r a K J ~ 2i?° 




Summing up, we learned that for any length scale J? <C 1, we have for some 
constant Co = Co(X,a, L) whose value we want to momentarily remember 

D(u,R)< and R?Xr < 77 - => sup (—) D(u,p)< 1. (88) 
Go Go r<i? V r / 

We now apply the elementary inequality 

1 1 fR\ a 1 1 

SU P ~D(u,r) < — sup - D{u,r) + —— sup —D(u,r), 
r <l r a R a o < r <R \r J R a Q ° 0 < r < 1 r «o 


which is valid for any ao < a and which we will use for an exponent «o 
appearing in Theorem [0 As a value for R we choose R = 4 y where 


M : = inf j- D(u,r ) < — and r 2 y r < —| 
f T Go Go j 


< max 


C '° sup —£>(«, r ) j “°»inf {M > 0|x_^ < 


cT 


>}■ 
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With these choices (1S5|) turns into 


sup — D(u, r) <M a + M a ~ a ° sup — D(u, r) 

r< 1 r a 0<r<l r a ° 

<M”+( sup -t-D(«,r))*, 

V 0<r<l r a ° / 


( 89 ) 


where in the last step we have used Young’s inequality. We claim that the 
random variable M has stretched exponential moments of the form 


1 


(exp(-M 2a °)) < 2. 


(90) 


Indeed, on the one hand 


2a 0 

a Q 


e x p(i(c„supY B( „, r) j “”j<2 
by Theorem [H and on the other hand we have for any threshold /i 

(/(inf {M > 0: XU < > /<)) < (ife > (£))£«-*. 

due to the uniform Gaussian tails of the xi, which implies that 


/I r _ M 2 'i \ 

exp(-mf{M>0:x i <- S r}) <2. 


(91) 


Using (l90ji and (19TT) on (l89]i yields as desired 

It 1 

sup 

■0</<l 


( exp (^(„ s < u ,?,> ( “' r) ) 2 ")> 


< 2. 


Proof of Lemma [2j Since thanks to r C 1 we have f rj r (h)(e^ — 1 ) 2 dh < 
r 2 , it is enough to show for any shift h 


< 


j f(uh ~ 

\h\ + (e ^ — l) 2 ^ J J fj 2 (u — cfdxdt^j , 


(92) 
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where, as in Lemma [H for the upcoming calculations we have replaced the 
exponential cut-off 77 = rj 2 by its smooth version r ) 2 where 


fj(x) \= exp(—-v/x 2 + 1 ) ~ t] 2 (x) 


(93) 


and we have set for abbreviation c := / fj 2 udxdt. By the martingale 

argument based on the stochastic differential equation 


d t {u h - u) = -(-d 2 x ){ir(u h ) - tt(u)) + (£ h - £) 


we have 


d_ 1 
dt 2 



u)( 1 — d 2 ) 1 r](u h — u)dxj 
h ~ u )(l — d x )~ 1 fj(—d x )(7r(u h ) — n(u))dx 


1 



(94) 


Let us make two comments on (|94|) : Like in Lemma [T] we use physics notation 
in the sense that an operator acts on all the terms to its right, e. g. in the 
above expression (1 — <9^) -1 7)(— dl)(n(u h ) — 7r(u)) = (l — d 2 )- 1 [fj(—d 2 )(7r(u h ) — 
7 r(u))]. The last term in (l94j) . which comes from the quadratic variation of the 
white noise in time, cf. Lemma [1] for a heuristic discussion, assumes this form 
because ^exp(— \x — y |) is the (translation-invariant) kernel of the operator 
(i - S3- 1 , so that y{x)\ exp (—\x — y\)rj(y ) is the kernel of the operator 
7/(1 — d 2 )~ l fj, so that the quadratic variation is indeed given by 



x (5((x+h) — (y+h)) — S((x+h ) — y) — 8{x — (y+h)) + 8(x — y))dxdy, 

where the spatial Dirac distributions come from the spatial white noise 
£, S p atj more precisely, they represent the covariance ((^ pat - t spa t)(x)(^ pat - 
Zs P at)(y))i of the increment ^ pat - £ spat . 

We integrate (19T)1 against the weight t + 1 in time over t G (—1,0). This 
yields (j§2|) once we establish the following three estimates: The following 
estimate on the quadratic variation 



( 95 ) 
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the following bound on the term under the time derivative 

J fj(u h — u)( 1 — dl)~ 1 fj(u h — u)dx < (e^ — l ) 2 J fj 2 (u — c) 2 dx, (96) 

and the fact that “elliptic term” controls the desired term up to the term in 


fj 2 {u h — u) 2 dx < — / f/(u h - u){l - d 2 x ) l fj{-d 2 x )('K{u h ) - n(u))dx 


+ C I rjiu 11 — u)(l — d 2 ) 1 fj[u h — u)dx. 


(97) 


We first address the quadratic variation term (j95]) . Writing 

fj - - ^fj~ h e~ w = fj( 1 - e~ |h| ) + ^fj h - ^V~ h ) 

and performing a discrete integration by parts, we see that this term takes 
the form of 

so that the estimate follows from the elementary estimate J(fj h — rf) 2 dx < 
h 2 J ( d x fj) 2 dx. 

We note that by duality, the estimate of the time-derivative term 
equivalent to 


is 


J Cv(u h - u)dx < (e w - 1) (J (C 2 + (d x ( ) 2 dx J r) 2 (u - cfdxj , 
which follows by discrete versions of integration by parts and Leibniz’ rule 

J Cv(u h ~ u)dx = j (( C h - C )fj + C h (fj~ h - v)) 0 - c)dx , 

Cauchy-Schwarz’ inequality, the standard estimate 

J ( c~ h - C fdx < h 2 J ( d x () 2 dx , (98) 

and the following property of our cut-off function with exponential tails 
\v~ h (x) -fj(x)\ 

= exp(-i\/x 2 + 1)| exp (^Vx 2 + 1 - ^V( x ~ ^) 2 + 1) ~ 1| 

< fj{x)\ exp(-^y-) — 1|. 
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Let us finally address the elliptic term (1971) . To this purpose we write (in our 
physicist’s way of omitting parentheses) 

v(-d 2 x )(n(u h ) - tt(u)) 

= (1 - dl)(n(u h ) - 7r(u))fj + 2d x (n(u h ) - n(u))d x fj 
-(n(u h )-n(u))(l-dl)fj, 

so that by the symmetry of (1 — (9 2 ) -1 (already used for ([Ml) ) 

J fj{u h - u) (1 - dl)- 1 fj(-dl)(7r(u h ) - ir{u))dx 

= j fj 2 (u h — u)[rr(u h ) — n(u))dx 

-2 J ( d x fj)(7r(u h ) - 7r(u))d x (l - d x )~ l fj{u h - u)dx 
~ J ((1 - 8%)fj)(n(u h )) - tt(«))(1 - dl)- 1 !j(u h - u)dx. 

Using that the operators d x {l — d 2 )~ 5 and (1 — d 2 )~^ have operator norm 
1 w. r. t. to L 2 , we deduce the inequality (where we use the abbreviation 
n h — tt := Tr(u h ) — tt(u)) 

J f)(u h -u)( 1 - dD^fji-dDi7T h - ir)dx 


> j fj 2 (u h — u)(n h — n)dx 

-(2( - *?dx) 2 + (/((! — - ’rfdx)' 

x ( f fj(u h — u)(l — ^.)~ 1 fj(u h — u)dx J . (99) 


By the monotonicity properties ([2D of tt, this yields 
I fj{u h - u)( 1 - d 2 )~ 1 fj(—d 2 )(Tr(u h ) - n(u))dx 
> \ J rj 2 (u h — u) 2 dx 

— ^2( J \d x fj) 2 (u h - u) 2 dx) 2 + (j ((l - d 2 x )fi) 2 (u h - u) 2 dx)' 
xf j fj(u h - u)(l - dl)- 1 ^ - u)dx) . (100) 
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(101) 


Our smoothing out of the exponential cut-off function ensures 

\d x fj\ + \d 2 x fj\ < fj, 

which allows us to use Young’s inequality in order to arrive at (l97|h 

Proof of Lemma [3j We will establish this lemma in the strengthened 
version with the bulk average J r]udxdt replaced by the surface average 
c := f rjudx\ t= _i. To this purpose we rewrite (P in form of 

d t (u - c) = -(-dl)(n(u) - 7r(c)) + f. 

As in Lemma [2J we replace 77 by r) 2 ~ 77 in the statement of this lemma, with 
77 being the mollified version of 772 , cf. (l93lh By the martingale argument we 
have like in Lemma [T] cf. ( fTJl ) . 

lth{ f -dl)~ l; n(u-c)dx^ 

= -( f V{u ~ c )(! - - n(c))dx ^ + ^J ^tfdx. 

We smuggle in an exponential term in the time variable with a rate T C 1 
to be adjusted later: 

^exp(-^)^ j fj(u - c)(l - dl)- 1 ^ - c)dx^ 

= — exp(— J fj(u - c)(l - dly'rjiu - c)dx 

+ J fj(u^c)(l-dly 1 fj(-dl){n{u)-n{c))dx^ + ^ J rfdx}. 

Lemma [3] will follow from integration over t G (0,1) of this identity, using 
the obvious estimates on the quadratic variation term 

J fj 2 dx < 1, 

and on the term under the time derivative 

J fj(u — c)(l — d 2 )^ 1 ^^ — c)dx < J fj 2 (u — c) 2 , 

once we show that the elliptic term controls the desired term for T sufficiently 
small: 

J fj(u - c)( 1 - a 2 ) _1 77(-c> 2 )(7r(M) - 7r(c))cLc 
^ J fj(u - c)(f - d 2 )- 1 ?%u - c)dx. (102) 
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The argument for this estimate (I102|) on the elliptic term follows the lines of 
the one in Lemma [2 Replacing the couple (u h ,u) from there by (u,c), we 
arrive at 


J rj(u - c)(l - d%) 1 fj(—dl)(n(u) — n(c))dx 

> xJ?(u-cfd X 

- (2 ( J ( d x fj) 2 (u - c) 2 dx) 5 + ( J ((1 - d 2 x )fi) 2 (u - cfdx) 
x ( j fj(u — c)(l — d 2 )~ l fi{u — c)dx\ . (103) 


Appealing to the estimates fjlOljl of the smoothened exponential cut-off r / 
and Young’s inequality, we obtain (11021) for a sufficiently large T. 


Proof of Lemma [TJ We fix an r < 1 and note that the statement of this 
lemma follows from 




(104) 


where (•),. denotes the expectation w. r. t. to the white noise restricted to 
(t,x) G (—r 2 ,0) x M, just by taking the expectation w. r. t. to (-)i. By the 
scale invariance (j5J) & (J5]), it is thus sufficient to establish the above for r = 1 . 
We shall replace the exponential averaging function 77 by its mollified version 

fj(x) — —exp(— Vx 2 + 1 ) with Cq := [ exp(— Vx 2 + 1 )dx, 

c 0 J 

noting that 77 ~ r) and pointing out the slight difference to Lemmas [2] and EJ 
cf. (I93j) . Indeed, 77 ~ 77 is enough to replace 77 by 77 on the r. h. s. of (1104)1 : 
for the 1 . h. s. this follows the L 2 -average in time of the estimate 


rjudx — / rjudx 


(77 — 77 ){u — / rju)dx 


< 




1 

2 


Hence with the abbreviation U(t) := / r]udx we need to show that 

((/ (U — J- U) 2 dxdt^ 2 \j < 1 “I - ( ( y / v( u ~ U) 2 dxdt^ 2 ^)(105) 
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After these preparations, we note that we may rewrite equation ([ID in form 
of 


d t u = d 2 x {i r(u) - vr(f/)) +f, 

From this we deduce the stochastic ordinary differential equation 


d t 


rjudx 


— n(U))dlfjdx + ad t W, 


where W is a standard temporal Wiener process and the variance is given by 


a 


2 . 


rjdx ~ 1. 


(106) 


We use the differential equation in its time-integrated version 



aW)) 2 dt 


i -i 


(TT[U 


7r( U))d 2 x fjdx ) 


2 

dt. 


Thanks to the Lipschitz continuity of 7r, Q, and the fact that due to our 
mollification fj of the exponential averaging function, we have \d 2 fj\ < f/, this 
turns into the estimate 



vW)fdt < f 


rj(u — U) 2 dxdt. 


By Poincare’s inequality (with vanishing mean value) and the triangle in¬ 
equality, and appealing to (11061) . this turns into 

/ 0 pO pO pO pO p 

(U- J Ufdt < J (W - j Wfdt + J J fj{u- Ufdxdt. 

By Jensen’s inequality and the defining properties on the quadratic moments 
of the Brownian motion, this implies 

/*0 />Q 1 /tQ n 

< (j (w- J Wfdty + ((J J rj(u- Ufdxdt) 1 *^ 

~ 1 + ((/ J v(u-U) 2 dxdt)*^, 

which is (1 105p . 
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Proof of LemmaO First of all, the observable sup aoG r A1 i sup( t a ,) g (_ 10 ) xR g(g h — 
g) 2 is independent from the noise £ outside of (—1,0) x M, so that we can 
replace the average (-)i by (•). 

We start with the representation formula 



(107) 


where G(t,x ) := -^=exp(—^-) denotes the heat kernel for d t — d 2 . We 
simultaneously consider also 



7-1 J 


and argue that 


{(g(a 0 , t, x ) - g(a 0 , s, y)) 2 ) < y/\t - s\ + \x - y\ (108) 


(( d ao g(a 0 , t, x) - d ao g(a 0 , s, y)f) < \'\t -s\ + \x-y\ (109) 

for all (t, x), (s,i/)6RxR and all ao € [A, 1]. 


Because of the initial conditions and symmetry, we may w. 1. o. g. assume 
that — 1 < s < t. Using the defining property ((/ (^dxdt ) 2 ) = f ( 2 dxdt for a 
test function ( of white noise (that is, (C(t', x')£(s', y')) = 8{t' — s')5{x' — y') 
in the rough but efficient physics language) we get an explicit expression for 
the covariances of these Gaussian fields. In the case of g this is 



' '-s+2 




( 110 ) 


where we used the semi-group property of t i— G(a 0 t, •) in the middle identity. 
In the of d ao g, i.e. the case where the integral kernel is given by G(a 0 t,x ) 
the semi-group property has to be replaced by 
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Using this formula, we obtain as in (II101) 


(d ao g(a 0 ,t, x)d ao (a 0 , s,y)) 


1 

8 


rt+s+2 


( a 2 — (t — s) 2 )<9f G(a 0 cx, x — y)da. 


' t—S 


We now pass from covariance to increment. For g we get 

(( g(ao,t,x) - g(a 0 ,s,y)) 2 ) 

= (g 2 (t, +) + ( g 2 (s , y)) - 2 (g(t, x)g(s, y)) 

' 1 /*2(t+l) 2 /»2(s+1) /»(£H-l)+(sH-l) . 

j G(a 0 cr, 0)dcr 


+ - 


’t—S 


rt+s+2 


+ 


(G(aocr, 0) — G(ao<J, x — y))da. 


't—S 


By positivity and monotonicity of G(a 0 cr, 0) in a and G(a 0 a, 0) > G(a 0 cr,z), 
this yields the inequality 


(( g(ao,t,x ) - g(a 0 ,s,y)) 2 ) 


f*t—S 


< 


< 


G(a 0 cr, 0 )da 


(G(a 0 a, 0) — G(a 0 cr, x — y))da 


f*t—S 


G(aocr, 0)da + \x — y\ / (G(ao<r, 0) — G(ao<x, l))da, 


where we used the scale invariance of G(aocr, z) in the second step. This in¬ 
equality implies (I108[) in this case because of G(a 0 cr, 0) < a~ 2 and G(a 0 cr, 0) — 

13 

G(a 0 a, 1) < min{cr _ 2 ; cr _ 2 |. Similarly we get for d ao g 
((d ao g(a 0 , t, x ) - d ao g(a 0 , s, y)) 2 ) 

1 ^ /»2(s+l) /»(£+l)+(s+l) 


< 


» do 8 , 

1 rh+i)+( s +i) 


/ t—S 


a 2 d 2 G(a 0 a : 0)da 


4 
1 
f 4 

rt—s 

*0 


(t — s) 2 dfG(a 0 a, 0 )da 


t—S 

rt+s+2 


(cr 2 — (t — s) 2 )(d 2 G(a 0 a,0) — d 2 G(a 0 a,x — y))da 


t—S 


■»(t+l)+(s+l) 


a a 2 da + (t — sY 


a 2 da 


' t—S 


a (dfG(a 0 a, 0) — dfG(a 0 a , x — y))da, 
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so that the desired estimate (11091) for d ao g follows as well. 

We now apply Kolmogorov’s continuity theorem to g and d ao g\ for the con¬ 
venience of the reader and because of its similarity to the proof of the main 
result of the paper, we give a self-contained argument, to fix notation in the 
case of g. We first appeal to Gaussianity to post-process (11081) . which we 
rewrite as 

(■^(s (*, x ) - y )) 2 ) < 1 provided \t - s\ < 3 R 2 , \x — y\ < R 

for a given scale R. We note that from (1107)1 we see that the properties of 
being Gaussian and centered transmits from £ to ^=( g(t,x ) — g(s,y)), so 
that by the above normalization we have 

( exp x ) — ^( S ’^)) 2 )) ~ 1 for I*- s l < 3i? 2 , \ x -y\ < R. (HI) 

Our goal is to estimate exponential moments of the local Holder-norm 
[</]«,(—i,o)x(—i,i) := sup sup \g(t, x) — g(s, y)\, 

.RE(0,1) ft (t,x),(s,y)e( — l,0)x( —1,1) 
sJ\t-s\+\x-y\<R 

which amounts to exchange the expectation and the supremum over (t,x), 
(s, y) in (lllip at the prize of a decreased Holder exponent a < 1 To this 
purpose, we now argue that for a > 0, the supremum over a continuum can 
be replaced by the supremum over a discrete set: For R < 1 we define the 
grid 

r R = [-1, o] x [-1, i] n ( r 2 z x rz) 


and claim that 

[fl']a 1 (-l,0)x(-l,l) (112) 

~ sup 7 F sup \9(t,x) -g(s,y)\ =: A, 

R -ft (t,x),(s,y)e r R 

\t—s\<SR 2 ,\x—y\<R 

where the first sup runs over all R of the form 2 _JV for an integer N > 1. 
Hence we have to show for arbitrary (t,x), ( s,y ) G (—1,0) x (—1,1) that 

I g(t,x) -g(s,y )| < A(v / I t - s\ + \x-y\) a . (113) 

By density, we may assume that (t, x), (s, y) G r 2 Z x rZ for some dyadic r = 
2 _JV < 1 (this density argument requires the qualitative a priori information 
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of the continuity of g, which can be circumvented by approximating £). By 
symmetry and the triangle inequality, we may assume s < t and x < y. 
For every dyadic level n = N, N — 1, • • • we now recursively construct two 
sequences (t n ,x n ) ( s n ,y n ) of space-time points (in fact, the space and time 
points can be constructed separately), starting from (£#,£#) = (t,x) and 
(sj\D 2 /jv) = (s,y), with the following properties 

a) they are in the corresponding lattice of scale 2~ n , i. e. (t n , x n ), (s n , x n ) G 
(2~ n ) 2 Z x 2~ n Z, 

b) they are close to their predecessors in the sense of | t n — t n+ i\, |s n — 
Sn+i| < 3(2^ (n+1) ) 2 and \x n - x n+1 \, \y n - y n+1 \ < 2~ (n+1) , so that by 
definition of A we have 

\g{t n ^X n ) yi^n+l 7 ^n+l) | ! Vn) 9 (^n+l j Vn+l ) | ^ A(2 ^ , 

( 114 ) 

and 


c) such that \t n — s n \ and \x n — y n \ are minimized among these points. 
Because of the latter, we have 

(fM, x M ) = (%, Vm) for some M with 2~ M < max{ y/\t — s|, \x — y |}, 
so that by the triangle inequality we gather from (11141) 

M f O — M \ Oi 

\g(t,x)-g(s, V )\< Y. A(2-l" +1 >)“ < 

n=N -1 


which yields ( 1 11 B 1 ) . 

Equipped with ( 11121 ) . we now may upgrade ( 11111 ) to 


( ex P (^Ma,(-i,o)x(-i,i))> % 1 


K C 


(115) 


for a < Indeed, (I112j) can be reformulated on the level of characteristic 
functions as 

^ £ rR l(j(s(.t,x)-g( S ,y)Y > ^ 4 L_), 

where as in (11121) R runs over all 2 _JV for integers N > 1. Replacing the 
suprema by sums in order to take the expectation, we obtain 

(%]*,(—i,0)x(—1,1) >M))<J 2 x )- 9 (s, y)f > C7 ^ 2 J ). 

R ( t,x),{s,y) 
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We now appeal to Chebyshev’s inequality in order to make use of (11111) : 


(^([5 , ]a,(-l,0)x(—1,1) — ^0) 

~ E E eXp ( C ) 

R (■ t,x),(s,y ) 

~ 2^W eM -cW^ ) 


R<1,M>1 M 1 , 1,1 

< <«p(-c)L7ji ex P(-c ( lF5s 

R 


u , .AT 
l))<exp(-—), 


where in the second step we have used that the number of pairs (t,x), (s, ?/) 
of neighboring lattice points is bounded by and in the last step we have 
used that stretched exponential decay (recall 1 — 2a > 0) beats polynomial 
growth. The last estimate immediately yields (11151) . 

It remains to post-process (II 15j) and the same bound for d ao g. We only need 
these bounds for second moments but with the spatial origin replaced by any 
point x: 

( [d] a,(-1,0) x (x-l,x+l) ) + ([dao0]a,(-l,O)x(*-l,a+l)) £ 1 for all X 6 R. (116) 

We use these bounds in the embedding H 1 L°° to get 


( Slip ^(uO)')■)]«,(—l,0)x(®—l,a:-|-l)} 
clqG [A, 1] 


-1 


-1 


^ J [^]q!,(— 1,0)X (x— 1,X+1) .L ^ da ° 9 ^ a,(—1,0) x (pc— l,cc+l) ^ 

< 1 for all 


(117) 


To obtain the statement of the lemma in form of 

(sup sup rj(g h -g) 2 )<mm{\h\ 2a ,l} (118) 

[A, 1] (-1,0) xR 

we distinguish the cases \h\ < | and \h\ > 1. In the first case we have for 
any a 0 

sup 7 l(g h -g) 2 < Vexp(-|a:|) sup (g h - g) 2 

(-l°) xR x&L (-l,0)x(x-|,x+i) 

kl<3 

< \h\ 2a ^exp(-\x\)[g}l t{ _ 10)x{x _ hx+1) , 

rrEZ 
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from which (jl 18j) follows by taking first the supremum over a 0 in each term 
and then the expectation and inserting (111711 . In case of \h\ > 1, we proceed 
via 


sup r](g h - gf 

(—l,0)xR 

< sup riq 2 + sup ri~ h q 2 

(—l,0)xR (—l,0)xR 


^ 5^(exp(-|x|) +exp( 

irG2Z 

g{t=- 1)=0 

< 5^(exp(-|x|) +exp( 

:cE2Z 


x — h D) sup g 2 

(—1,0) x (a:—l,a;+l) 

X~h\)) SUp [s]a I (-l l 0)x(*-l ia H-l)- 

(—l,0)x(a;—l,x+l) 


Proof of Corollary |TJ We start by defining a modified local Holder 
norm, based on the D(u,r). For R > 0 set 

r R = n[-i, l] x [-1, i] n (r 2 z x nz) . (119) 

Then we define the modified Holder semi-norm 

Ha — sup sup ~l~D(u {t ’ x) ,R), 

R (t,x)er H R 


where for a space-time point (t, x) we write u^' x \t, x) = u(t + £, x + x) and 
the first supremum is taken over all R = 2 ~ N for integer N > 1. We claim 
that 

Ma < Ha- (120) 

This claim is established below, but first we proceed to prove Corollary Q] 
assuming that (1120(1 holds. 

To this end, fix a < a' < 1. From (J2J) we get for any R G (0,1) and 
1 < a < oo 


= (/(exp ((T,£>(«, K)f^) > exp (p.R"-“') 2 3))} 
< exp ( - («T/?”-”') 2 ^) ( “P ( R )j ^ ) ) 


(fTol > 

< exp 


,. 9^ 
a—a \ z 


(, oR a ~ a ') 


< exp 


e. Q n 

a«' 


C 



( 121 ) 
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for a suitable constant C. In the third line we have used Chebyshev’s in¬ 
equality. By translation invariance the same bounds holds if u is replaced by 
u( t,x ' for any space-time point (t,x). Therefore, we get 


'([»].>»)) < EE 


a 


R (t,x)eR r 


{mJ-Ginj 

< 


< 


cr 


exp - 


exp 


j R ~ 3 ex P (- {a ~ a>) 

R 


n °0 

CT o' 


C 


This fast decay of the tails of the distribution of the [u] a implies the desired 
integrability property. 

It remains to establish the bound ( 11201 ) . We rely on Campanato’s character¬ 
ization of Holder spaces [ 8 j Theorem 5.5] which in our current context states 
that [u] a is controlled by 


sup sup — (-f -f fu ^ to,x °) 




u 


(to,xo) 


1 

2 


( 122 ) 


To see that [w] a controls this norm, we observe that for r > 0 satisfying 
2 -N -2 < r < 2~ n ~ 1 any arbitrary (t 0 ,x 0 ) G [—1,1] x [—1,1] can be well 
approximated in A 2 -jv , in the sense that there exists (i, x) G A 2 -jv satisfying 
\xq — x\ < 2~( Ar+1 ) and 11 0 — t\ < 2 -2(Ar+1 b Then we get, for R = 2~ N using 
the definition of 77 # 



Therefore, we can conclude that [wJq, controls the Campanato norm defined 
in (11221) and the proof of (11201) (and therefore the proof of Corollary Q]) is 
complete. 
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